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| Abstract 

A multiple antenna downlink channel where limited channel feedback is available to the transmitter is considered. 
' In a vector downlink channel (single antenna at each receiver), the transmit antenna array can be used to transmit 

separate data streams to multiple receivers only if the transmitter has very accurate channel knowledge, i.e., if there 
is high-rate channel feedback from each receiver. In this work it is shown that channel feedback requirements can 
. be significantly reduced if each receiver has a small number of antennas and appropriately combines its antenna 

outputs. A combining method that minimizes channel quantization error at each receiver, and thereby minimizes 
multi-user interference, is proposed and analyzed. This technique is shown to outperform traditional techniques 
such as maximum-ratio combining because minimization of interference power is more critical than maximization 
of signal power in the multiple antenna downlink. Analysis is provided to quantify the feedback savings, and the 
technique is seen to work well with user selection and is also robust to receiver estimation error. 
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I. Introduction 



\ Multi-user MIMO techniques such as zero-forcing beamforming allow for simultaneous transmission of multiple 
£> 1 data streams even when each receiver (mobile) has only a single antenna, but very accurate channel state information 
. (CSI) is generally required at the transmitter in order to utilize such techniques. In the practically motivated finite 
^ \ rate feedback model, each mobile feeds back a finite number of bits describing its channel realization at the 
j— ( ' beginning of each block or frame. In the vector downlink channel (multiple transmit antennas, single antenna at 
■ each receiver), the feedback bits are determined by quantizing the channel vector to one of 2 B quantization vectors. 
While a relatively small number of feedback bits suffice to obtain near-perfect CSIT performance in a point-to- 
point vector/MISO (multiple-input, single-output) channel [1], considerably more feedback is required in a vector 
downlink channel. If zero-forcing beamforming (ZFBF) is used, the feedback rate must be scaled with the number 
. £h | of transmit antennas as well as SNR in order to achieve rates close to perfect CSIT systems [2]. In such a system 
^ ' the transmitter emits multiple beams and uses its channel knowledge to select beamforming vectors such that nulls 
^ . are created at certain users. Inaccurate CSI leads to inaccurate nulling and thus translates directly into multi-user 
interference and reduced SINR/throughput. 

In this paper we consider the MIMO downlink channel, in which the transmitter and each mobile have multiple 
antennas (M transmit antennas, N antennas per mobile), in the same limited feedback setting. We propose a receive 
antenna combining technique, dubbed quantization-based combining (QBC), that converts the MIMO downlink into 
a vector downlink in such a way that the system is able to operate with reduced channel feedback. Each mobile 
linearly combines its N antenna outputs and thereby creates a single antenna channel. The resulting vector channel 
is quantized and fed back, and transmission is then performed as in a normal vector downlink channel. 

With QBC the combiner weights are chosen on the basis of both the channel and the vector quantization codebook 
to produce the effective single antenna channel that can be quantized most accurately. On the other hand, traditional 
combining techniques such as the maximum-ratio based technique that is optimal for point-to-point MIMO channels 
with limited channel feedback [3] or direct quantization of the maximum eigenmode are aimed towards maximization 
of received signal power but generally do not minimize channel quantization error. Since channel quantization error is 
so critical in the MIMO downlink channel, quantization-based combining leads to better performance by minimizing 
quantization error (i.e., interference power) possibly at the expense of channel (i.e., signal) power. 

One way to view the advantage of QBC is through its reduced feedback requirements relative to the vector 
downlink channel. In [2] it is shown that scaling (per mobile) feedback as B = M ~ 1 PdB, where P represents the 
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SNR, suffices to maintain a maximum gap of 3 dB (equivalent to 1 bps/Hz per mobile) between perfect CSIT and 
limited feedback performance in a vector downlink channel employing ZFBF. With QBC, our analysis shows that 
the same throughput (3 dB away from a vector downlink with perfect CSIT) can be achieved if feedback is scaled 
at the slower rate of B M ^ N PdB- In other words, QBC allows a MIMO downlink to mimic vector downlink 
performance with reduced channel feedback. 

Alternatively, QBC can be thought of as an effective method to utilize multiple receive antennas in a downlink 
channel in the presence of limited channel feedback. Although it is possible to send multiple streams to each mobile 
if receive combining is not performed, this requires even more feedback from each mobile than a single-stream 
approach. In addition, QBC has the advantage that the transmitter need not be aware of the number of receive 
antennas being used. 

The remainder of this paper is organized as follows: In Section [II] we introduce the system model and some 
preliminaries. In Section HIT] we describe a simple antenna selection method that leads directly into Section [IV] 
where the much more powerful quantization-based combining technique is described in detail. In Section |V] we 
analyze the throughput and feedback requirements of QBC. In Section [VI] we compare QBC to alternative MIMO 
downlink techniques, and finally we conclude in Section I VII I 

II. System Model and Preliminaries 

We consider a K mobile (receiver) downlink channel in which the transmitter (access point) has M antennas, 
and each of the mobiles has N antennas. The received signal at the i-th antenna is given by: 

yi = hfx + ni, i = l,..., NK (1) 

where hi, Y12, ■ ■ ■ , iiKN are the channel vectors (with hj € C Mxl ) describing the KN receive antennas, x € C Mxl 
is the transmitted vector, and . . . , riNK are independent complex Gaussian noise terms with unit variance. The 
/c-th mobile has access to J/(fe_i)jv+i> • • • iVNk- The input must satisfy a power constraint of P, i.e. .E[||x|| 2 ] < P. 
We use Hfc to denote the concatenation of the fc-fh mobile's channels, i.e. H^. = [h( fc _i)jy + i • • • h/vfc]- We consider 
a block fading channel with iid Rayleigh fading from block to block, i.e., the channel coefficients are iid complex 
Gaussian with unit variance. Each of the mobiles is assumed to have perfect knowledge of its own channel Hj, 
although we analyze the effect of relaxing this assumption in Section IV-CI In this work we study only the ergodic 
capacity, or the long-term average throughput. Furthermore, we only consider systems for which N < M because 
QBC is not very useful if N > M; this point is briefly discussed in Section [TV] 

A. Finite Rate Feedback Model 

In the finite rate feedback model, each mobile quantizes its channel to B bits and feeds back the bits perfectly 
and instantaneously to the transmitter at the beginning of each block [3] [4]. Vector quantization is performed using 
a codebook C of 2 B M-dimensional unit norm vectors C = {wi, . . . , w 2 b}, and each mobile quantizes its channel 
to the quantization vector that forms the minimum angle to it [3] [4]: 

h k = arg min sin 2 (Z(h fe , w)) . (2) 

W=Wi,...,W 2 B 

For analytical tractability, we study systems using random vector quantization (RVQ) in which each of the 2 B 
quantization vectors is independently chosen from the isotropic distribution on the M-dimensional unit sphere and 
where each mobile uses an independently generated codebook [5]. We analyze performance averaged over random 
codebooks; similar to Shannon's random coding argument, there always exists at least one quantization codebook 
that performs as well as the ensemble average. 

B. Zero-Forcing Beamforming 

After receiving the quantization indices from each of the mobiles, the AP can use zero-forcing beamforming 
(ZFBF) to transmit data to up to M users. For simplicity let us consider the N = 1 scenario, where the channels are 
the vectors hi, . . . , hjv/- When ZFBF is used, the transmitted signal is defined as x = J2k=i x k v k, where each x k 
is a scalar (chosen complex Gaussian) intended for the A;-th mobile, and v& £ C M is the k-th mobile's BF vector. 
If there are M mobiles (randomly selected), the beamforming vectors vi, ... , vm are chosen as the normalized 
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rows of the matrix [hi ■ ■ ■ h^] -1 , i.e., they satisfy ||vfc|| = 1 for all k and hj^Vj = for all j / k. If all multi-user 
interference is treated as additional noise and equal power loading is used, the resulting SINR at the fc-th receiver 
is given by: 

p_\Y 1 h v |2 

SINR k = Ml fc p fc ' — -. (3) 

The coefficient that determines the amount of interference received at mobile k from the beam intended for mobile 
j, |h^Vj | 2 , is easily seen to be an increasing function of mobile &'s quantization error. 

In the above expression we have assumed that M mobiles are randomly selected for transmission and that equal 
power is allocated to each mobile. However, the throughput of zero-forcing based MIMO downlink channels can 
be significantly increased by transmitting to an intelligently selected subset of mobiles [6]. In order to maximize 
throughput, users with nearly orthogonal channels and with large channel magnitudes are selected, and waterfilling 
can be performed across the channels of the selected users. In [7] a low-complexity greedy algorithm that selects 
users and performs waterfilling is proposed. If this algorithm is used, a zero-forcing based system can come quite 
close to the true sum capacity of the MIMO downlink, even for a moderate number of users. 



C. MIMO Downlink with Single Antenna Mobiles 

In [2] the vector downlink channel (N = 1) is analyzed assuming that equal power ZFBF is performed without 
user selection on the basis of finite rate feedback (with RVQ). The basic result of [2] is that: 



Rfb(P) > Rcsit(P) - log 2 ( 
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where Rfb{P) and Rcsit(P) are the ergodic per-user throughput with feedback and with perfect CSIT, respec- 
tively, and the quantity E 



sm 



(z(h fc ,h fc )) 



is the expected quantization error. The expected quantization error 

can be accurately upper bounded by 2~ «-i and therefore the throughput loss due to limited feedback is upper 
bounded by log 2 (i+ p .2~^y which is an increasing function of the SNR P. If the number of feedback bits 
(per mobile) is scaled with P according to: 

S = (M-l)log 2 P 



M — 1 



-P 



dBi 



then the difference between Rfb{P) and Rcsit(P) is upper bounded by 1 bps/Hz at all SNR's, or equivalently 
the power gap is at most 3 dB. As the remainder of the paper shows, quantization-based combining significantly 
reduces the quantization error (more precisely, it increases the exponential rate at which quantization error goes to 
zero as B is increased) and therefore decreases the rate at which B must be increased as a function of SNR. 



III. Antenna Selection for Reduced Quantization Error 

In this section we describe a simple antenna selection method that reduces channel quantization error. Description 
of this technique is primarily included for expository reasons, because the simple concept of antenna selection 
naturally extends to the more complex (and powerful) QBC technique. In point-to-point MIMO, antenna selection 
corresponds to choosing the receive antenna with the largest channel gain, while in the MIMO downlink the receive 
antenna that can be vector quantized with minimal angular error is selected. Mobile 1, which has channel matrix 
Hi = [hi-.-hjv] and a single quantization codebook consisting of 2 B quantization vectors wi, . . . ,w 2 b, first 
individually quantizes each of its N vector channels hi, . . . , h/v 

gi = arg min sin 2 (Z(h i; w)) i = l,...,N, (5) 

W=Wi,...,W 2 B 

and then selects the antenna with the minimum quantization error: 

j = arg min sin 2 (Z(h;, &)) , (6) 

i=l,..., N 

and feeds back the quantization index corresponding to gZ The mobile uses only antenna j for reception, and thus 
the system is effectively transformed into a vector downlink channel. 
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Due to the independence of the channel and quantization vectors, choosing the best of N channel quantizations 
is statistically equivalent to quantizing a single vector channel using a codebook of size N ■ 2 B . Therefore, antenna 
selection effectively increases the quantization codebook size from 2 B to N ■ 2 B , and thus the system achieves the 
same throughput as a vector downlink with B + log 2 N feedback bits. Although not negligible, this advantage is 
much smaller than that provided by quantization-based combining. 

IV. Quantization-Based Combining 

In this section we describe the quantization-based combining (QBC) technique that reduces channel quantization 
error by appropriately combining receive antenna outputs. We consider a linear combiner at each mobile, which 
effectively converts each multiple antenna mobile into a single antenna receiver. The combiner structure for a 3 
user channel with 3 transmit antennas (M = 3) and 2 antennas per mobile (JV = 2) is shown in Fig. Q] Each 
mobile linearly combines its N outputs, using appropriately chosen combiner weights, to produce a scalar output 
(denoted by y| ff ). The effective channel describing the channel from the transmit antenna array to the effective 
output of the k-th mobile (y| ff ) is simply a linear combination of the N vectors describing the N receive antennas. 
After choosing combining weights the mobile quantizes the effective channel vector and feeds back the appropriate 
quantization index. Only the effective channel output is used to receive data, and thus each mobile effectively has 
only one antenna. 

The key to the technique is to choose combiner weights that produce an effective channel that can be quantized 
very accurately; such a choice must be made on the basis of both the channel vectors and the quantization codebook. 
This is quite different from maximum ratio combining, where the combiner weights and quantization vector are 
chosen such that received signal power is maximized but quantization error is generally not minimized. Note that 
antenna selection corresponds to choosing the effective channel from the N columns of while QBC allows for 
any linear combination of these N column vectors. 



A. General Description 

Let us consider the effective received signal at the first mobile for some choice of combiner weights, which we 
denote as 7 1 = (71 1, • • • , 7i,jv)- In order to maintain a noise variance of one, the combiner weights are constrained 
to have unit norm: ||7i|| = 1. The (scalar) combiner output, denoted y| ff , is: 

N / N \ N 

i=l \i=l / 1=1 

= (hf) ff x + n, 

where n = J2iLilii n i * s UIUt variance complex Gaussian because I7-J = 1. The effective channel vector hf 1 is 
simply a linear combination of the vectors hi, ... , hjv: hp = Y2i=i 7i,*hi = Hi7i- Since 7 1 can be any unit norm 
vector, hp can be in any direction in the A-dimensional subspace spanned by hi, ... , hjy, i.e., in span(Hi)Q 

Because quantization error is so critical to performance, the objective is to choose combiner weights that yield 
an effective channel that can be quantized with minimal error. The error corresponding to effective channel hf { is 

jnin sin 2 (Z(hf , wj)) • (V) 



'By well known properties of iid Rayleigh fading, the matrix Hi is full rank with probability one [8], 
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Therefore, the optimal choice of the effective channel is the solution to: 

.2 / //ueff 



mm 

hf 1-- 



mm B Sm 2 (Z(hf,wO), (8) 

1,...,2 



where h^ ff is allowed to be in any direction in span(Hi). Once the optimal effective channel is determined, the 
combiner weights -f 1 can be determined through a simple pseudo-inverse operation. 

Since the expression for the optimum effective channel given in ([8]) consists of two minimizations, without loss 
of optimality the order of the minimization can be switched to give: 

min min sin 2 ( Z(hi ff , wi)) , (9) 

1=1,. ..,2 B hf 

For each quantization vector w/, the inner minimization finds the effective channel vector in span(Hi) that forms 
the minimum angle with wj. By basic geometric principles, the minimizing hf is the projection of w; on span(Hi). 
The solution to the inner minimization in © is therefore the sine squared of the angle between and its projection 
on span(Hi), which is referred to as the angle between wj and the subspaceH. As a result, the best quantization 
vector, i.e., the solution of (©, is the vector that forms the smallest angle between itself and span(Hi). The optimal 
effective channel is the (scaled) projection of this particular quantization vector onto span(Hi). 

In order to perform quantization, the angle between each quantization vector and span(Hi) must be computed. 
If qi,...,qjv form an orthonormal basis for span(Hi) and Qi = [qi • • ■ qjv]> then sin 2 (Z(w, span(Hi))) = 
1 — ||Q{^w|| 2 . Therefore, mobile l's quantized channel, denoted hi, is: 

hi = arg min |Z(w, span(Hi))| = arg max ||Qfw|| 2 . (10) 

W=Wi,...,W 2 B W=Wi,...,W 2 B 

Once the quantization vector has been selected, it only remains to choose the combiner weights. The projection 
of hi on span(Hi), which is equal to QiQ^hi, is scaled by its norm to produce the unit norm vector s^ 0> . The 
direction specified by s^ r0J has the minimum quantization error amongst all directions in span(Hi), and therefore 
the effective channel should be chosen in this direction. First we find the vector ui G C N such that Hiui = sf 0} , 
and then scale to get ~y 1 . Since s^ 10J is in span(Hi), ui is uniquely determined by the pseudo-inverse of Hi: 

ui^HfH^Hfsr, (11) 

and the combiner weight vector *y 1 is the normalized version of ui: 7 = tt^tt- The quantization procedure is 
illustrated for a N = 2 channel in Fig. [2] In the figure the span of the two channel vectors is shown along with 
the quantization vector hi, its projection on the channel subspace, and the effective channel. 



B. Algorithm Summary 

We now summarize the quantization-based combining procedure performed at the k-th mobile: 

1) Find an orthonormal basis, denoted qi, . . . , q^r, for span(Hfc) and define = [qi • • • qjv]. 

2) Find the quantization vector closest to the channel subspace: 

hfc = arg max ||Q^w|| 2 . (12) 

W=Wi,...,W 2 fl 

3) Determine the direction of the effective channel by projecting h^ onto span(Hfc). 

nroj = QfcQfhfc 

HQfcQfhfcH 

4) Compute the combiner weight vector -y k : 

(HfH^HgsT 
IKHfHj-'H^ril' 

2 If the number of mobile antennas is equal to the number of transmit antennas (N = M), the channel vectors span C M with probability 
one. Therefore, each quantization vector has zero angle with the channel subspace and as a result the solution to the inner minimization 
in l|9} is trivially zero for each w;. Thus, performing quantization with the sole objective of minimizing angular error (i.e., QBC) is not 
meaningful when N = M and is therefore not studied here. 



Fig. 2. Quantization procedure for a two antenna mobile 



Each mobile performs these steps, feeds back the index of its quantized channel h^, and then linearly combines 
its N received signals using vector 7 fe to produce its effective channel output yf 1 = (h| ff )^x+n with h.^ = Hklk- 
Note that the transmitter need not be aware of the number of receive antennas or of the details of this procedure 
because the downlink channel appears to be a single receive antenna channel from the transmitter's perspective; 
this clearly eases the implementation burden of QBC. 



V. Throughput Analysis 

Quantization-based combining converts the MIMO downlink channel into a vector downlink with channel vectors 
h^ ff , . . . , and channel quantizations hj • • • h^- We first derive the statistics of the effective vector channel, then 
analyze throughput for ZFBF with equal power loading and no user selection, and finally quantify the effect of 
receiver estimation error. 



A. Channel Statistics 

We first determine the distribution of the quantization error and the effective channel vectors with respect to both 
the random channels and random quantization codebooks. 

Lemma 1: The quantization error sin 2 (Z(hfc, h| ff )), is the minimum of 2 B independent beta (M— N, N) random 
variables. 

Proof: If the columns of MxN matrix form an orthonormal basis for span(H ( ! c ), then cos 2 (Z(w/, span(H&)) 
||Q^w/|| 2 for any quantization vector. Since the basis vectors and quantization vectors are isotropically chosen and 
are independent, this quantity is the squared norm of the projection of a random unit norm vector in C M onto a 
random A-dimensional subspace, which is described by the beta distribution with parameters iV and M — N [9]. 
By the properties of the beta distribution, sin 2 (Z(w;, span(H^)) = 1 — cos 2 (Z(w^, span(Hfc)) is beta (M — N, N). 
Finally, the independence of the quantization and channel vectors implies independence of the 2 B random variables. 

■ 

Lemma 2: The normalized effective channels ^ h la^ , ■ ■ ■ , pffijj are iid isotropic vectors in C . 

Proof: From the earlier description of QBC, note that m^Sfit = which is the projection of the best 

quantization vector onto span(H^). Since each quantization vector is chosen isotropically, its projection is isotropi- 
cally distributed within the subspace. Furthermore, the best quantization vector is chosen based solely on the angle 
between the quantization vector and its projection. Thus s| roj is isotropically distributed in span(Hfc). Since this 
subspace is also isotropically distributed, the vector s£ roj is isotropically distributed in C M . Finally, the independence 
of the quantization and channel vectors from mobile to mobile implies independence of the effective channel 
directions. ■ 
Lemma 3: The quantity ||h^ ff || 2 is X^m- n+D' 

Proof: Using the notation from Section HV-AI the norm of the effective channel is given by: 



-,P ro J i 

lUfcH" MUftir" ii u fclh ii u /ci 



|hf || 2 = ||H fc7fc || 2 = ||H fc ^y 2 = ^||H fc u fc || 2 = = (15) 



where we have used the definitions h^ ff = Hklk an ^ 7fc = TTu^TT' an( ^ tne ^ act tnat Ufc sat i sr i es H^Ufc = s^ r0J . 
Therefore, in order to characterize the norm of the effective channel it is sufficient to characterize n 1 ii 2 - The A^- 
dimensional vector is the set of coefficients that allows s? r0J , the normalized projection of the chosen quantization 
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vector, to be expressed as a linear combination of the columns of (i.e., the channel vectors). Because s? 
is isotropically distributed in span(H^) (Lemma [2), if we change coordinates to any (iV-dimensional) basis for 
span(Hfc) we can assume without loss of generality that the projection of the quantization vector is [1 • • • 0] T . 
Therefore, the distribution of ^jjp is the same as the distribution of ^pp^rpij — ■ Since the N x N matrix H^H^ 

is Wishart distributed with M degrees of freedom, this quantity is well-known to be X^fM-N+l) ' see ^ or a 
proof. ■ 
The norm of the effective channel has the same distribution as that of a (M — N + l)-dimensional random vector 
instead of a M-dimensional vector. An arbitrary linear combination (with unit norm) of the N channel vectors 
would result in another iid complex Gaussian M-dimensional vector, whose squared norm is x| M , but the weights 
defining the effective channel are not arbitrary due to the inverse operation. 



B. Sum Rate Performance Relative to Perfect CSIT 

After receiving the quantization indices from each of the mobiles, a simple transmission option is to perform 
equal-power ZFBF based on the channel quantizations (as described in Section III-Bb . If K = M or K > M and 
M users are randomly selected, the resulting SINR at the fc-th mobile is given by: 

P_ \(y.ett\H^ r |2 

SINR k = A/K V !' „ • (16) 

The ergodic sum rate achieved by QBC, denoted Rqbc{P)> is therefore given by: 



Rqbc(P) — %,w 



log 2 1 + 



&l(hf) H Vfc| 2 
1 + Eft*£l(hf)*v i w 



3 I 

where the expectation is taken with respect to the fading and the random quantization codebooks. 

In order to study the benefit of QBC we compare Rqbc{P) to the sum rate achieved using zero-forcing 
beamforming on the basis of perfect CSIT in an M transmit antenna vector downlink channel (single receive 
antenna), denoted Rzf-csit{P)- We use the vector downlink with perfect CSIT as the benchmark because QBC 
converts the system into a vector downlink, and the rates achieved by QBC cannot exceed Rzf-csit{P) (even as 
B — ► oo). We later describe how this metric can easily be translated into a comparison between Rqbc(P) and the 
sum rate achievable with linear precoding (i.e., block diagonalization) in an N receive antenna MIMO downlink 
channel with CSIT. 

In a vector downlink with perfect CSIT, the BF vectors (denoted vzF,k) can be chosen perfectly orthogonal to 
all other channels. Thus, the SNR of each user is as given in (0 with zero interference terms in the denominator 
and the resulting average rate is: 



Rzf~csit{P) = Eh 



log 2 (l + ^MvzfA 2 ) 



Following the procedure in [2], the rate gap AR(P) is defined as the difference between the per-user throughput 
achieved with perfect CSIT and with feedback-based QBC: 

AR(P) 4 RzF _ CSIT ( P ) _ R QBC {P). (17) 

Similar to Theorem 1 of [2], we can upper bound this throughput loss: 
Theorem 1: The per-user throughput loss is upper bounded by: 

/ A ^ l\ / fM-N+l\ , n, , - offN 



AR(P) < ^ - log 2 e + log 2 ll+P —— E [sm'(Z(h k ,hf))} 

\l=M-N+l J ^ ^ ' 

Proof: See Appendix. ■ 

The first term in the expression is the throughput loss due to the reduced norm (Lemma [3]) of the effective channel, 

while the second (more significant) term, which is an increasing function of P, is due to quantization error. In 

order to quantify this rate gap, the expected quantization error needs to be bounded. By Lemma [T] the quantization 
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error is the minimum of 2 B iid beta(M — N, N) RV's. Furthermore, a general result on ordered statistics applied 
to beta RV's gives [9, Chapter 4.I.B]: 

^[ S in 2 (Z(h fc ,hf))]<F x 1 (2- B ) 
where Fx(x) is the inverse of the CDF of a beta (M — N,N) random variable, which is: 

F X ( X ) = £(^)*^^ 

where the approximation is the result of keeping only the lowest order x term and dropping (1 — x) terms; this is 
valid for small values of x. Using this we get the following approximation: 

n » te __B_ ( Ad — l\ ~~M^~N 

E[ S m 2 (Z(h k ,hf))]^2-—^ N _ i j . (18) 

The accuracy of this approximation is later verified by our numerical results. Plugging this approximation into the 
upper bound in Theorem Q] we get: 




AR(P) - | >; ,| 1 „ E2e + lo g2 ( 1+ P.(™l)2^(^: i 1 )""-"j (19) 

If B is fixed, quantization error causes the system to become interference-limited as the SNR is increased (see [2, 
Theorem 2] for a formal proof when N = 1). However, if B is scaled with the SNR P such that the quantization 
error decreases as -p, the rate gap in ( fT9l ) can be kept constant and the full multiplexing gain (M) is achieved. In 
order to determine this scaling, we set the approximation of AR(P) in (fT9l equal to a rate constant log 2 b and solve 
for B as a function of P. Thus, a per-mobile rate loss of at most log 2 b (relative to Rzf-csit(P)) is maintained 
if B is scaled as: 

B N « (M - N) log 2 P-{M-N) log 2 c-(M-N) log 2 ( M _^ +1 ) ~ log 2 ~* 

M-N ( M \ (M-\\ 

~ —5— PdB-{M-N)\og 2 c-{M-N)\og 2 ( ^ — -log 2 AT J, (20) 



M-N+1J ^\N-l 

_ i m — l l \ 

where c = b - e ^i=m-n+i V — I. Note that a per user rate gap of log 2 b = 1 bps/Hz is equivalent to a 3 dB power 
gap in the sum rate curves. 

As discussed in Section [Tl-CI scaling feedback in a single receive antenna downlink as B\ = PdB maintains 
a 3 dB gap from perfect CSIT throughput. Feedback must also be increased linearly if QBC is used, but the slope of 
this increase is ~ when mobiles have only a single antenna compared to a slope of M ~ N for antenna combining. 
If we compute the difference between the N = 1 feedback load and the QBC feedback load, we can quantify how 

much less feedback is required to achieve the same throughput (3 dB away from a vector downlink channel with 
perfect CSIT) if QBC is used with N antennas/mobile: 

^qbc(N) = B 1 -B N ^ ^j^PdB + log 2 r£Z^) - - !) lQ g2 e. 

The sum rate of a 6 transmit antenna downlink channel (M = 6) is plotted in Fig. [3] The perfect CSIT zero- 
forcing curve is plotted along with the rates achieved using finite rate feedback with B scaled according to (l20l for 
iV = 1, 2 and 3. For N = 2 and N = 3 QBC is performed and the fact that the throughput curves are approximately 
3 dB away from the perfect CSIT curve verify the accuracy of the approximations used to derive the feedback 
scaling expression in d20b - In this system, the feedback savings at 20 dB are 7 and 12 bits, respectively, for 2 and 
3 receive antennas. All numerical results in the paper are generated using the method described in Appendix ITT] 

It is also important to compare QBC throughput to the throughput of a MIMO downlink channel with N antennas 
per mobile. The most meaningful comparison is to the rate achievable with block diagonalization (BD) [11] without 



9 




SNR (dB) 

Fig. 3. Sum rate of M = K — 6 downlink channel 

user selection and with equal power loading. In this case, 4£ mobiles are transmitted to (with N data streams per 
mobile). In [12] it is shown that the BD sum rate is 

jy- N-l jy _ ■ 

A BD ^ ZF (N) = (log 2 e)— —r 1 

i=i 3 

larger than Rzf-csit{P) at asymptotically high SNR, and that this offset is accurate even for moderate SNR's. 
This can be translated to a power offset by multiplying by to give 31( ]| 2 6 ^2f=i~ —j-^ dB, which equates to 2.16 
dB and 3.61 dB for N = 2 and N = 3. Therefore, the rate offset between QBC and BD with CSIT is the sum of 
AR(P) (equation fTT]) and Abd-zf(N). In Fig. [3] the BD sum rate curves are plotted, and their shifts relative to 
ZF-CSIT are seen to follow the predicted power gaps. 

C. Effect of Receiver Estimation Error 

Although the analysis until now has assumed perfect CSI at the mobiles, a practical system always has some 
level of receiver error. We consider the scenario where a shared pilot sequence is used to train the mobiles. If (3M 
downlink pilots are used (/3 > 1 pilots per transmit antenna), channel estimation at the k-th mobile is performed on 
the basis of observation G& = \/ f3PHk + n^. The MMSE estimate of is = ^f p Gk, and the true channel 
matrix can be written as the sum of the MMSE estimate and independent estimation error: 

H fc = G fc + e fc , (21) 

where is white Gaussian noise, independent of the estimate Gfc, with per-component variance (1 + /3P) -1 . After 
computing the channel estimate G^, the mobile performs QBC on the basis of the estimate Gfc to determine the 
combining vector f k . As a result, the quantization vector very accurately quantizes the vector Gfc7 fc , which is 
the mobile's estimate of the effective channel output, while the actual effective channel is given by h| ff = Hk'Jk- 
For simplicity we assume that coherent communication is possible, and therefore the long-term average throughput 
is again P[log 2 (l + SINR^)] where the same expression for SINR given in (fT6l ) applied The general throughput 
analysis in Section [V] still applies, and in particular, the rate gap upper bound given in Theorem Q] still holds 
if the expected quantization error takes into account the effect of receiver noise. As shown in Appendix JIIJ the 
approximate rate loss with receiver error is: 

- *-(JL3M w -(t)^(£0^4 <22) 

3 We have effectively assumed that each mobile can estimate the phase and SINR at the effective channel output. In practice this could be 
accomplished via a second round of pilots as described in [13]. 
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Fig. 4. Combining with Imperfect CSIR: M = 4, N = 2, K = 4, B scaled with SNR 

Comparing this expression to ( fl9l ) we see that estimation error leads only to the introduction of an additional ^ term. 
If feedback is scaled according to (l20l ) the rate loss is log 2 (6 + /3 _1 ) rather than log 2 (6). In Figure [4] the throughput 
of a 4 mobile system with M = 4 and N = 2 is plotted for perfect CSIT/CSIR and for QBC performed on the 
basis of perfect CSIR (f3 = oo) and imperfect CSIR for (3 = 1 and [5 = 2. Estimation error causes non-negligible 
degradation, but the loss decreases rather quickly with /? (which can be increased at a reasonable resource cost 
because pilots are shared). 



In this section we compare the throughput of QBC to other receive combining techniques and to limited feedback- 
based block diagonalizatior@. For all results on receiving combining, the user selection algorithm of [7] is applied 
assuming limited feedback (B bits) regarding the direction of the effective channel and perfect knowledge of the 
effective channel nor ml We first describe these alternative approaches and then discuss some numerical results. 

A. Alternate Combining Techniques 

The optimal receive combining technique for a point-to-point MIMO channel in a limited feedback setting is to 
select the quantization vector that maximizes received power [3]: 



Because this method roughly corresponds to maximum ratio combining, it is referred to as MRC. If BF vector w 
is used by the transmitter, received power is maximized by choosing 7 = || H jj w || [3], which yields = Hk7k = 

niru h it ■ When B is not very small, with high probability the quantization vector that maximizes ||H^ w|| is the 
vector that is closest to the eigenvector corresponding to the maximum eigenvalue of H^H^ . To see this, consider 
the maximization of ||H^w|| when w is constrained to have unit norm but need not be selected from a finite 
codebook. This corresponds to the classical definition of the matrix norm, and the optimizing w is in the direction 
of the maximum singular value of When B is not too small, the quantization error is very small and as a 
result the solution to d23l is extremely close to ||Hfc|| 2 . As a result, selecting the quantization vector according to 
the criteria in A23\) is roughly equivalent to directly finding the quantization vector that is closest to the direction 
of the maximum singular value of 

4 It should be noted that comparisons with block diagonalization are somewhat rough because systems that perform BD on the basis of 
limited feedback and that employ user/stream selection have not yet been extensively studied in the literature, to the best of our knowledge. 
As a result, it may be possible to improve upon the BD systems we use here as the point of comparison. 

'Although the rate gap upper bound derived in Theorem Q] only rigorously applies to systems with equal power loading and random 
selection of M mobiles, the bound can be used to reasonably approximate the throughput degradation due to limited feedback even when 
user selection is performed. See [14] for a further discussion of the effect of limited feedback on systems employing user selection. 



VI. Performance Comparisons 



hfc = arg max 




(23) 
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Fig. 5. Quantization Error for Different Combining Techniques (M = 4, N = 2) 





Effective Channel Norm 


Quantization Error 


Single RX Antenna (TV = 1) 


2 

X2M 


2 -B/(M-i) 


Antenna Selection 


2 

X2M 




MRC 


~ max eigenvalue 


2 ~B/(M-1) 


Max Eigenvector 


max eigenvalue 


2 -B/(M-t) 


QBC 


2 

X2(M-JV+1) 


2 -B/(M-N) 



TABLE I 

Summary of Combining Techniques 




The maximum singular value of can be directly quantized if the mobile first selects the combiner weights j k 
such that the effective channel h| ff = Hklk * s i n tne direction of the maximum singular value, which corresponds 
to selecting *y k equal to the eigenvector corresponding to the maximum eigenvalue of the N x N matrix H^H^, 
and then finds the quantization vector closest to h| ff . The effective channel norm satisfies ||h^ ff || 2 = ||Hfc|| 2 , which 
can be reasonably approximated as a scaled version of a Xzmn ran dom variable [15]. Therefore the norm of the 
effective channel is large, but notice that the quantization procedure reduces to standard vector quantization, for 

_ B 

which the error is roughly 2 M l . 

In Figure [51 numerically computed values of the quantization error (log 2 (-E'[sin 2 (Z(h| ff , h^))]) are shown for 
QBC, antenna selection, MRC (corresponding to equation |23t, and direct quantization of the maximum eigenvector, 

B J— I 

along with approximation 2 J/ l as well as the approximation from ( fl81 ), for a M = 4, N = 2 channel. Note that 
the error of QBC is very well approximated by ( fl~8l ), and the exponential rate of decrease of the other techniques 

_ B 

are all well approximated by 2 *f-i. 

Each combining technique transforms the MIMO downlink into a vector downlink with a modified channel norm 
and quantization error. These techniques are summarized in Table U The key point is that only QBC changes 
the exponent of the quantization erroi^, which determines the rate at which feedback increases with SNR. When 
comparing these techniques note that the complexity of QBC and MRC are essentially the same: QBC and MRC 
require computation of ||Q|^w|| 2 and ||Hj^w|| 2 , respectively. 

B. Block Diagonalization 

An alternative manner in which multiple receive antennas can be used is to extend the linear precoding structure 
of ZFBF to allow for transmission of multiple data streams to each mobile. Block diagonalization (BD) selects 

6 An improvement over QBC is to choose the quantization vector and combining weights that maximize the expected received SINR (the 
true SINR depends on the BF vectors, which are unknown to the mobile). This extension of QBC, which will surely outperform QBC and 
MRC, has been under investigation by other researchers since the initial submission of this manuscript and the results will be published 
shortly [16]. 
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Fig. 6. Different Combining Techniques: M = 4, N = 2, K = 4, B scaled with SNR 

precoding matrices such multi-user interference is eliminated at each receiver, similar to ZFBF. In order to select 
appropriate precoding matrices, the transmitter must know the iV-dimensional subspace spanned by each mobile 
channel H^. Thus an appropriate feedback strategy is to have each mobile quantize and feedback its channel 
subspace. The effect of limited feedback in this setting (assuming there are mobiles and equal power loading 
across users and streams is performed) was studied in [17]. In order to achieve a bounded rate loss relative to a 
perfect CSIT (BD) system, feedback (per mobile) needs to scale approximately as N(M — N) log 2 P- Thus, the 
aggregate feedback load summed over mobiles is approximately M(M — N) log 2 P, which is (approximately) 
the same as the aggregate feedback in a QBC system in which each of the M mobiles uses B as (M — N) log 2 P. 
Thus, there is a rough equivalence between QBC and BD in terms of feedback scaling, and this is later confirmed 
by our numerical results. 

It is also possible to perform user and stream selection when BD is used, and [18] presents an extension of the 
algorithm of [7] to the multiple receive antenna setting (referred to as maximum eigenmode transmission, or MET). 
In essence, MET treats each mobile's N eigenmodes as a different single antenna receiver and selects eigenmodes 
in a greedy fashion using the approach of [7]. Thus, in a limited feedback setting a reasonable strategy is to have 
each user separately quantize the directions of its N eigenvectors and also feed back the corresponding eigenvalues. 

C. Numerical Results 

In Figures [6] and [7] throughput curves are shown for a 4 transmit antenna, 2 receive antenna (M = 4, N = 2) 
system with K = 4 mobiles. Sum rate is plotted for three different combining techniques (QBC, antenna selection, 
and MRC) and for a vector downlink channel (N = 1); the BD curves are discussed in later paragraphs. In Fig. 
|6l B (per mobile) is scaled according to (l20l ). i.e., roughly as (M — N) log 2 P, while in Fig. [7] each mobile uses 
10 bits of feedback. As expected, the throughput of antenna selection, MRC, and the single antenna system all lag 
behind QBC in Fig. [6j particularly at high SNR. This is because the (M — N) log 2 P scaling of feedback is simply 
not sufficient to maintain good performance if these techniques are used. To be more precise, the quantization 
error goes to zero slower than -p which corresponds to interference power that increases with SNR, and thus a 
reduction in the slope (i.e., multiplexing gain) of these curves. In Fig. |7J MRC outperforms QBC for SNR less 
than approximately 12 dB because signal power is more important than quantization error (i.e., interference power), 
i.e., the system is not yet interference-limited. However, at higher SNR's QBC outperforms MRC because of the 
increased importance of quantization error. 

Figures [6] and [7] also include plots of the throughput of a BD system. In this system, 2 of the 4 users are randomly 
selected to feedback subspace information, and equal power BD with no selection is used to send 2 streams to each 
of these mobiles, for a total of 4 streams. In order to equalize the aggregate feedback load, each of the 2 users 
is allocated double the feedback budget of the combining-based systems; this corresponds to using two times the 
scaling of (l20l) in Fig. [6] and 20 bits per mobile in Fig. [7] BD performs slightly better than QBC in both figures, 
but we later see that this advantage is lost for larger K. 
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Fig. 7. Different Combining Techniques: M = 4, N = 2, K = 4, B = 10 

Figures [8] displays throughput for a 4 transmit antenna, 2 receive antenna (M = 4, N = 2) system at 10 dB 
against K, the number of mobiles. Capacity refers to the sum capacity of the system (with CSIT), MET-CSIT is the 
throughput achieved using the MET algorithm on the basis of CSIT[18], and ZF-CSIT is the throughput of a vector 
downlink with CSIT and user selection [7]. Below these are four limited feedback curves for 10 bits of feedback 
per mobile. The first three, QBC, MRC, and antenna selection, correspond to different combining techniques, while 
MET-FB corresponds to performing MET on the basis of 5 bit quantization of each eigenmode (10 bits total 
feedback per mobile). QBC achieves significantly higher throughput than MRC or antenna selection, particularly 
for larger values of K. The ZF-CSIT curve is shown because it serves as an upper bound on the performance of 
QBC, and the gap between the two is quite reasonable even for B = 10. MET-FB is seen to perform extremely 
poorly: this is not too surprising because the MET algorithm is likely to only choose the strongest eigenmode 
of a few users [18], and thus half of the feedback is essentially wasted on quantization of each user's weakest 
eigenmode. This motivates dedicating all 10 bits to quantization of the strongest eigenmode, but note that this 
essentially corresponds to MRC, which is outperformed by QBC. The huge gap between MET-CSIT and MET-FB 
indicates that MET has the potential to provide excellent performance, but extremely high levels of feedback may 
be necessary to realize MET's potential. 

Finally, Figure [9] shows throughput versus number of users K for a 6 transmit antenna (M = 6) channel with 
either 1 or 2 receive antennas. Sum capacity for N = 1 and N = 2 is plotted, along with the sum rate of a 
perfect-CSIT TDMA system in which only the receiver with the largest point-to-point capacity is selected for 
transmission. The ZF and QBC curves correspond to systems with user selection and either single receive antennas 
or quantization-based combining, respectively, for feedback levels of 10, 15, and 20 bits per mobile. For each 
feedback level, an additional receive antenna with QBC provides a significant throughput gain relative to a single 
receive antenna system. Furthermore, QBC significantly outperforms TDMA (N = 2) for B = 15 or B = 20, and 
provides an advantage over TDMA for B = 10 when the number of users is sufficiently large. Note, however, that 
there is a significant gap between QBC and N = 2 capacity even when 20 bits of feedback are used; this indicates 
that there may be room for significant improvement beyond QBC. 

VII. Conclusion 

The performance of multi-user MIMO techniques such as zero-forcing beamforming critically depend on the 
accuracy of the channel state information provided to the transmitter. In this paper, we have shown that receive 
antenna combining can be used to reduce channel quantization error in limited feedback MIMO downlink chan- 
nels, and thus significantly reduce channel feedback requirements. Unlike traditional maximum-ratio combining 
techniques that maximize received signal power, the proposed quantization-based combining technique minimizes 
quantization error, which translates into minimization of multi-user interference power. 

Antenna combining is just one method by which multiple receive antennas can be used in the MIMO downlink. It 
is also possible to transmit multiple streams to each mobile, or to use receive antennas for interference cancellation 
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Fig. 9. Combining and User Selection: M = 6, N — 1,2 



if the structure of the transmitted signal is known to the mobile. It remains to be seen which of these techniques 
is most beneficial in practical wireless systems when channel feedback resources and complexity requirements are 
carefully accounted for. 



Appendix I 
Proof of Theorem Q] 

Plugging the rate expressions into the definition of A(P), we have A(P) = A a + A5 where 



A a = E u [log 2 (1 + p\h% v ZFjfe | 2 )] - E H ,w 
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where p = To upper bound A , we define normalized vectors 
that the norm and directions of and of h| ff are independent. Using this we have: 



h k /\\h k \\ and hf = hf /||hf ||, and note 
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(24) 



where Xp is (5{M — N + 1, N — 1). Since the BF vector v^F.fc is chosen orthogonal to the (M — 1) other channel 
vectors {h.j}j^ k , each of which is an iid isotropic vector, it is isotropic and is independent of h k . By Lemma |2] the 
same is also true of and hf, and therefore we can substitute |h& vzF,k\ 2 f° r I (hf) v &| 2 - Finally, note that 
the product Xg||h^|| 2 is xl(Af-JV+i) because ||hfc|| 2 is x 2 m> anc ^ therefore Xg||hfc|| 2 and ||hf || 2 have the same 
distribution. Using (l24l we get: 
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where we have used log 2 (Xp) = log 2 ( g X2M — J and results from [8] to to compute E [logo (^s)]- 



, X2(M-JV + 1) , 

Finally, we upper bound A& using Jensen's inequality: 
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Appendix II 
Generation of Numerical Results 

Rather than performing brute force simulation of RVQ, which becomes infeasible for B larger than 15 or 20, 
the statistics of RVQ can be exploited to efficiently and exactly emulate the quantization process: 

1) Draw a realization of the quantization error Z according to its known CDF (Lemma [TJ. 

2) Draw a realization of the corresponding quantization vector according to: 

h fc = ( y/l-z) u + VZs 



where u is isotropic in span(Hfc), s is isotropic in the nullspace of span(Hfc), with u, s independent. 
These steps exactly emulate step 2 of QBC. The same procedure can also be used to emulate antenna selection, 
quantization of the maximum eigenvector, and no combining (N = 1). Because the CDF of the quantization error 
is not known for MRC, MRC results are generated using brute force RVQ. 
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Appendix III 
Rate Gap with Receiver Estimation Error 



We bound the rate gap using the technique of [13]. We first restate the result of Theorem Q] in terms of the 



interference terms E 
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Using the representation of the channel matrix given in (f2TT >. we can write the interference term as: 



(hf)% 
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The first term in the sum is statistically identical to the interference term when there is perfect CSIR, while the 
second term represents the additional interference due to the receiver estimation error. Because the noise and the 
channel estimate are each zero-mean and are independent we have: 
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The first term comes from the perfect CSIR analysis and is equal to the product of and the expected quantization 
error with perfect CSIR. Because ~y k and Vj are each unit norm and is independent of these two vectors, the 
quantity (e k ~/ k ) H Vj is (zero-mean) complex Gaussian with variance (1 + I3P)- 1 , which is less than (1 + (3P)" 1 . 
We finally reach (l22l by using the approximation for quantization error from ( fLST ) and plugging into d25l ). and 
noting that (1 + (3P)- 1 w (fiP)- 1 . 
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